Overview

Dataset statistics

Number of variables27
Number of observations10000
Missing cells212
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 MiB
Average record size in memory877.1 B

Variable types

NUM12
CAT10
BOOL3
UNSUPPORTED1
DATE1

Reproduction

Analysis started2020-05-23 15:35:04.011028
Analysis finished2020-05-23 15:36:42.692804
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
moramaxima_12_meses_1 is highly correlated with moramaxima_12_mesesHigh Correlation
moramaxima_12_meses is highly correlated with moramaxima_12_meses_1High Correlation
sector has 120 (1.2%) missing values Missing
ingresos is highly skewed (γ1 = 40.05295954) Skewed
tiempactivano is highly skewed (γ1 = 50.65164527) Skewed
calificacionsistema_financiero is an unsupported type, check if it needs cleaning or further analysis Rejected
ingresos has 255 (2.5%) zeros Zeros
personascargo has 7507 (75.1%) zeros Zeros
tiempactivano has 584 (5.8%) zeros Zeros
moramaxima_12_meses has 4222 (42.2%) zeros Zeros
moramaxima_12_meses_1 has 4222 (42.2%) zeros Zeros
antiguedad_en_el_sistema_financiero has 3368 (33.7%) zeros Zeros
numero_de_creditos_vigentes has 8955 (89.5%) zeros Zeros

Variables

cliente
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count10000
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5000.5
Minimum1
Maximum10000
Zeros0
Zeros (%)0.0%
Memory size78.2 KiB

Quantile statistics

Minimum1
5-th percentile500.95
Q12500.75
median5000.5
Q37500.25
95-th percentile9500.05
Maximum10000
Range9999
Interquartile range (IQR)4999.5

Descriptive statistics

Standard deviation2886.89568
Coefficient of variation (CV)0.5773214038
Kurtosis-1.2
Mean5000.5
Median Absolute Deviation (MAD)2500
Skewness0
Sum50005000
Variance8334166.667
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.e+00 1.e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
5424 1 < 0.1%
 
1338 1 < 0.1%
 
7481 1 < 0.1%
 
5432 1 < 0.1%
 
9526 1 < 0.1%
 
3379 1 < 0.1%
 
1330 1 < 0.1%
 
7473 1 < 0.1%
 
9518 1 < 0.1%
 
Other values (9990) 9990 99.9%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
10000 1 < 0.1%
 
9999 1 < 0.1%
 
9998 1 < 0.1%
 
9997 1 < 0.1%
 
9996 1 < 0.1%
 

mora30
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
0
7641
1
2359
ValueCountFrequency (%) 
0 7641 76.4%
 
1 2359 23.6%
 

mora60
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
0
8867
1
 
1133
ValueCountFrequency (%) 
0 8867 88.7%
 
1 1133 11.3%
 

segmento
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
MDO
6838
STD
2389
VIP
 
484
MPY
 
157
PY
 
132
ValueCountFrequency (%) 
MDO 6838 68.4%
 
STD 2389 23.9%
 
VIP 484 4.8%
 
MPY 157 1.6%
 
PY 132 1.3%
 

Length

Max length3
Mean length2.9868
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 9 100.0%
 
ValueCountFrequency (%) 
Latin 9 100.0%
 
ValueCountFrequency (%) 
ASCII 9 100.0%
 

sector
Categorical

MISSING
Distinct count10
Unique (%)0.1%
Missing120
Missing (%)1.2%
Memory size78.2 KiB
PERSONAS NATURALES
9467
COMERCIO
 
162
SERVICIOS
 
154
AGROPECUARIO
 
49
MANUFACTURA
 
34
Other values (5)
 
14
ValueCountFrequency (%) 
PERSONAS NATURALES 9467 94.7%
 
COMERCIO 162 1.6%
 
SERVICIOS 154 1.5%
 
AGROPECUARIO 49 0.5%
 
MANUFACTURA 34 0.3%
 
SERVICIOS FINANCIEROS 7 0.1%
 
EDIFICACIONES 4 < 0.1%
 
GOBIERNO 1 < 0.1%
 
RECURSOS NATURALES 1 < 0.1%
 
INFRAESTRUCTURA 1 < 0.1%
 
(Missing) 120 1.2%
 

Length

Max length21
Mean length17.465
Min length3
ValueCountFrequency (%) 
Uppercase_Letter 18 85.7%
 
Lowercase_Letter 2 9.5%
 
Space_Separator 1 4.8%
 
ValueCountFrequency (%) 
Latin 20 95.2%
 
Common 1 4.8%
 
ValueCountFrequency (%) 
ASCII 21 100.0%
 

regcons
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
ESPECIAL
2952
OCCIDENTE
2750
CENTRO
1694
SUR
1494
NORTE
1110
ValueCountFrequency (%) 
ESPECIAL 2952 29.5%
 
OCCIDENTE 2750 27.5%
 
CENTRO 1694 16.9%
 
SUR 1494 14.9%
 
NORTE 1110 11.1%
 

Length

Max length9
Mean length6.8562
Min length3
ValueCountFrequency (%) 
Uppercase_Letter 13 100.0%
 
ValueCountFrequency (%) 
Latin 13 100.0%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

fdesem
Date

Distinct count1029
Unique (%)10.3%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
Minimum2011-03-02 00:00:00
Maximum2016-02-29 00:00:00
Histogram

ingresos
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count2906
Unique (%)29.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.86068213
Minimum0
Maximum4527.398
Zeros255
Zeros (%)2.5%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0.45
Q10.9201875
median1.1684035
Q31.284
95-th percentile1.388005
Maximum4527.398
Range4527.398
Interquartile range (IQR)0.3638125

Descriptive statistics

Standard deviation79.0319445
Coefficient of variation (CV)16.25943487
Kurtosis1831.176937
Mean4.86068213
Median Absolute Deviation (MAD)7.345154414
Skewness40.05295954
Sum48606.8213
Variance6246.048252
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 5.00000000e-07 1.50000000e-06 1.55000000e-05 1.10000000e-02 ... 3.33903330e+01 5.53839580e+01 1.10161000e+02 5.18653653e+02 4.52739800e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1.2 1239 12.4%
 
1.3 689 6.9%
 
1 516 5.2%
 
1.1 279 2.8%
 
0 255 2.5%
 
0.64435 242 2.4%
 
0.8 178 1.8%
 
0.9 174 1.7%
 
1.25 143 1.4%
 
0.616 136 1.4%
 
Other values (2896) 6149 61.5%
 
ValueCountFrequency (%) 
0 255 2.5%
 
1e-06 92 0.9%
 
2e-06 1 < 0.1%
 
1e-05 1 < 0.1%
 
2.1e-05 1 < 0.1%
 
ValueCountFrequency (%) 
4527.398 1 < 0.1%
 
3079.894 3 < 0.1%
 
1961.198031 1 < 0.1%
 
1451.898583 3 < 0.1%
 
750 1 < 0.1%
 

personascargo
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3769
Minimum0
Maximum6
Zeros7507
Zeros (%)75.1%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.7621708998
Coefficient of variation (CV)2.022209869
Kurtosis6.595041672
Mean0.3769
Median Absolute Deviation (MAD)0.56587766
Skewness2.399817814
Sum3769
Variance0.5809044804
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 3.5 4.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 7507 75.1%
 
1 1555 15.6%
 
2 692 6.9%
 
3 176 1.8%
 
4 50 0.5%
 
5 18 0.2%
 
6 2 < 0.1%
 
ValueCountFrequency (%) 
0 7507 75.1%
 
1 1555 15.6%
 
2 692 6.9%
 
3 176 1.8%
 
4 50 0.5%
 
ValueCountFrequency (%) 
6 2 < 0.1%
 
5 18 0.2%
 
4 50 0.5%
 
3 176 1.8%
 
2 692 6.9%
 

gastos
Real number (ℝ≥0)

Distinct count1228
Unique (%)12.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3890325201
Minimum0.003214285714
Maximum0.9
Zeros0
Zeros (%)0.0%
Memory size78.2 KiB

Quantile statistics

Minimum0.003214285714
5-th percentile0.308
Q10.308
median0.374726
Q30.44975
95-th percentile0.55
Maximum0.9
Range0.8967857143
Interquartile range (IQR)0.14175

Descriptive statistics

Standard deviation0.08429324901
Coefficient of variation (CV)0.216674043
Kurtosis1.128621063
Mean0.3890325201
Median Absolute Deviation (MAD)0.06913968457
Skewness0.6516273447
Sum3890.325201
Variance0.007105351828
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00321429 0.16625 0.29982143 0.304 0.308032 ... 0.5779875 0.5975 0.600054 0.618 0.9 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.308 2646 26.5%
 
0.4 632 6.3%
 
0.35 388 3.9%
 
0.5 360 3.6%
 
0.45 320 3.2%
 
0.375 188 1.9%
 
0.6 160 1.6%
 
0.425 158 1.6%
 
0.344 158 1.6%
 
0.55 118 1.2%
 
Other values (1218) 4872 48.7%
 
ValueCountFrequency (%) 
0.003214285714 6 0.1%
 
0.03285714286 6 0.1%
 
0.0625 6 0.1%
 
0.09214285714 8 0.1%
 
0.12 2 < 0.1%
 
ValueCountFrequency (%) 
0.9 6 0.1%
 
0.8 2 < 0.1%
 
0.7 2 < 0.1%
 
0.68 2 < 0.1%
 
0.65 4 < 0.1%
 

tiempactivano
Real number (ℝ≥0)

SKEWED
ZEROS
Distinct count36
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean386.1667
Minimum0
Maximum1050000
Zeros584
Zeros (%)5.8%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q33
95-th percentile9
Maximum1050000
Range1050000
Interquartile range (IQR)2

Descriptive statistics

Standard deviation19253.15957
Coefficient of variation (CV)49.85712019
Kurtosis2585.928623
Mean386.1667
Median Absolute Deviation (MAD)766.4430666
Skewness50.65164527
Sum3861667
Variance370684153.6
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00e+00 5.00e-01 1.50e+00 2.50e+00 3.50e+00 ... 2.95e+01 3.05e+01 3.70e+01 5.70e+01 1.05e+06], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 4533 45.3%
 
2 1880 18.8%
 
3 852 8.5%
 
0 584 5.8%
 
4 533 5.3%
 
5 433 4.3%
 
6 250 2.5%
 
7 200 2.0%
 
8 142 1.4%
 
10 124 1.2%
 
Other values (26) 469 4.7%
 
ValueCountFrequency (%) 
0 584 5.8%
 
1 4533 45.3%
 
2 1880 18.8%
 
3 852 8.5%
 
4 533 5.3%
 
ValueCountFrequency (%) 
1050000 2 < 0.1%
 
866880 2 < 0.1%
 
72 4 < 0.1%
 
42 2 < 0.1%
 
32 4 < 0.1%
 

ocupacion
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
EMPLEADO
9550
JUBILADOS/PENSIONADO
 
444
PROFESIONAL INDEPENDIENTE
 
6
ValueCountFrequency (%) 
EMPLEADO 9550 95.5%
 
JUBILADOS/PENSIONADO 444 4.4%
 
PROFESIONAL INDEPENDIENTE 6 0.1%
 

Length

Max length25
Mean length8.543
Min length8
ValueCountFrequency (%) 
Uppercase_Letter 16 88.9%
 
Space_Separator 1 5.6%
 
Other_Punctuation 1 5.6%
 
ValueCountFrequency (%) 
Latin 16 88.9%
 
Common 2 11.1%
 
ValueCountFrequency (%) 
ASCII 18 100.0%
 

tipcontrato
Categorical

Distinct count7
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
TÉRMINO INDEFINIDO
7038
TÉRMINO FIJO
1734
OBRA, LABOR O MISIÓN
 
744
OTROS
 
446
LIBRE NOMBRAMIENTO O REMOCIÓN
 
16
Other values (2)
 
22
ValueCountFrequency (%) 
TÉRMINO INDEFINIDO 7038 70.4%
 
TÉRMINO FIJO 1734 17.3%
 
OBRA, LABOR O MISIÓN 744 7.4%
 
OTROS 446 4.5%
 
LIBRE NOMBRAMIENTO O REMOCIÓN 16 0.2%
 
CARRERA ADMINISTRATIVA 14 0.1%
 
NOMBRAMIENTO PROVISIONAL 8 0.1%
 

Length

Max length29
Mean length16.5566
Min length5
ValueCountFrequency (%) 
Uppercase_Letter 19 90.5%
 
Other_Punctuation 1 4.8%
 
Space_Separator 1 4.8%
 
ValueCountFrequency (%) 
Latin 19 90.5%
 
Common 2 9.5%
 
ValueCountFrequency (%) 
ASCII 19 100.0%
 

edad
Real number (ℝ≥0)

Distinct count52
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.6953
Minimum18
Maximum69
Zeros0
Zeros (%)0.0%
Memory size78.2 KiB

Quantile statistics

Minimum18
5-th percentile22
Q127
median30
Q338
95-th percentile57
Maximum69
Range51
Interquartile range (IQR)11

Descriptive statistics

Standard deviation10.30236244
Coefficient of variation (CV)0.305750726
Kurtosis1.060081067
Mean33.6953
Median Absolute Deviation (MAD)8.0212968
Skewness1.256705791
Sum336953
Variance106.1386718
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[18. 19.5 20.5 22.5 24.5 ... 47.5 48.5 51.5 63.5 69. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
27 892 8.9%
 
26 892 8.9%
 
28 656 6.6%
 
29 575 5.8%
 
30 500 5.0%
 
31 450 4.5%
 
25 431 4.3%
 
32 416 4.2%
 
33 336 3.4%
 
34 312 3.1%
 
Other values (42) 4540 45.4%
 
ValueCountFrequency (%) 
18 8 0.1%
 
19 38 0.4%
 
20 140 1.4%
 
21 222 2.2%
 
22 190 1.9%
 
ValueCountFrequency (%) 
69 22 0.2%
 
68 20 0.2%
 
67 20 0.2%
 
66 18 0.2%
 
65 26 0.3%
 

estado_civil
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
SOLTERO
6323
CASADO
1847
OTROS
1562
DIVORCIADO
 
188
VIUDO
 
80
ValueCountFrequency (%) 
SOLTERO 6323 63.2%
 
CASADO 1847 18.5%
 
OTROS 1562 15.6%
 
DIVORCIADO 188 1.9%
 
VIUDO 80 0.8%
 

Length

Max length10
Mean length6.5433
Min length5
ValueCountFrequency (%) 
Uppercase_Letter 12 100.0%
 
ValueCountFrequency (%) 
Latin 12 100.0%
 
ValueCountFrequency (%) 
ASCII 12 100.0%
 

genero
Categorical

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
MASCULINO
5505
FEMENINO
4495
ValueCountFrequency (%) 
MASCULINO 5505 55.0%
 
FEMENINO 4495 45.0%
 

Length

Max length9
Mean length8.5505
Min length8
ValueCountFrequency (%) 
Uppercase_Letter 11 100.0%
 
ValueCountFrequency (%) 
Latin 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 

ingresos_totales
Real number (ℝ≥0)

Distinct count1599
Unique (%)16.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.7911684244
Minimum0.6
Maximum1.232
Zeros0
Zeros (%)0.0%
Memory size78.2 KiB

Quantile statistics

Minimum0.6
5-th percentile0.616
Q10.636
median0.762145
Q30.9
95-th percentile1.1
Maximum1.232
Range0.632
Interquartile range (IQR)0.264

Descriptive statistics

Standard deviation0.1595499484
Coefficient of variation (CV)0.2016636958
Kurtosis-0.4433238676
Mean0.7911684244
Median Absolute Deviation (MAD)0.133902923
Skewness0.6853864304
Sum7911.684244
Variance0.02545618604
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.6 0.608 0.6160255 0.6160275 0.616064 ... 1.1007295 1.155975 1.1954025 1.200108 1.232 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.616 2078 20.8%
 
0.8 452 4.5%
 
0.7 344 3.4%
 
0.9 296 3.0%
 
1 294 2.9%
 
0.75 194 1.9%
 
0.688 172 1.7%
 
0.85 164 1.6%
 
1.2 126 1.3%
 
1.1 124 1.2%
 
Other values (1589) 5756 57.6%
 
ValueCountFrequency (%) 
0.6 4 < 0.1%
 
0.616 2078 20.8%
 
0.616019 2 < 0.1%
 
0.616024 2 < 0.1%
 
0.616027 34 0.3%
 
ValueCountFrequency (%) 
1.232 4 < 0.1%
 
1.2316 2 < 0.1%
 
1.2313 2 < 0.1%
 
1.23 2 < 0.1%
 
1.229 2 < 0.1%
 

nivel_academico
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
BACHILLER
3235
TECNÓLOGO
3213
UNIVERSITARIO
2988
OTROS
 
428
ESPECIALIZACIÓN
 
136
ValueCountFrequency (%) 
BACHILLER 3235 32.4%
 
TECNÓLOGO 3213 32.1%
 
UNIVERSITARIO 2988 29.9%
 
OTROS 428 4.3%
 
ESPECIALIZACIÓN 136 1.4%
 

Length

Max length15
Mean length10.1056
Min length5
ValueCountFrequency (%) 
Uppercase_Letter 18 100.0%
 
ValueCountFrequency (%) 
Latin 18 100.0%
 
ValueCountFrequency (%) 
ASCII 17 100.0%
 

tipo_vivienda
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
FAMILIAR
8284
PROPIA
 
1122
ALQUILADA
 
594
ValueCountFrequency (%) 
FAMILIAR 8284 82.8%
 
PROPIA 1122 11.2%
 
ALQUILADA 594 5.9%
 

Length

Max length9
Mean length7.835
Min length6
ValueCountFrequency (%) 
Uppercase_Letter 11 100.0%
 
ValueCountFrequency (%) 
Latin 11 100.0%
 
ValueCountFrequency (%) 
ASCII 11 100.0%
 
Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
A
9594
B
 
122
D
 
106
E
 
102
C
 
76
ValueCountFrequency (%) 
A 9594 95.9%
 
B 122 1.2%
 
D 106 1.1%
 
E 102 1.0%
 
C 76 0.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Uppercase_Letter 5 100.0%
 
ValueCountFrequency (%) 
Latin 5 100.0%
 
ValueCountFrequency (%) 
ASCII 5 100.0%
 

calificacionsistema_financiero
Unsupported

REJECTED
UNSUPPORTED
Missing92
Missing (%)0.9%
Memory size78.2 KiB

moramaxima_12_meses
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count91
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.0129
Minimum0
Maximum364
Zeros4222
Zeros (%)42.2%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median17
Q330
95-th percentile107
Maximum364
Range364
Interquartile range (IQR)30

Descriptive statistics

Standard deviation40.10702172
Coefficient of variation (CV)1.541812782
Kurtosis12.11807548
Mean26.0129
Median Absolute Deviation (MAD)26.5646022
Skewness2.96156024
Sum260129
Variance1608.573191
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 7. 14.5 15.5 ... 197.5 200.5 228.5 229.5 364. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4222 42.2%
 
17 1366 13.7%
 
30 549 5.5%
 
47 375 3.8%
 
16 324 3.2%
 
18 300 3.0%
 
48 286 2.9%
 
28 230 2.3%
 
29 190 1.9%
 
13 164 1.6%
 
Other values (81) 1994 19.9%
 
ValueCountFrequency (%) 
0 4222 42.2%
 
1 36 0.4%
 
13 164 1.6%
 
14 42 0.4%
 
15 136 1.4%
 
ValueCountFrequency (%) 
364 2 < 0.1%
 
351 4 < 0.1%
 
321 4 < 0.1%
 
320 2 < 0.1%
 
304 2 < 0.1%
 
Distinct count9262
Unique (%)92.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.7122772989
Minimum0
Maximum1
Zeros5
Zeros (%)0.1%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0.155959325
Q10.563666625
median0.7992201363
Q30.9209716541
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)0.3573050291

Descriptive statistics

Standard deviation0.2606009835
Coefficient of variation (CV)0.3658701237
Kurtosis-0.05900857697
Mean0.7122772989
Median Absolute Deviation (MAD)0.2135547169
Skewness-0.9821633154
Sum7122.772989
Variance0.06791287258
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.00431405 0.02260463 0.07008375 0.1147331 ... 0.98192354 0.99101755 0.99174735 0.99998087 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 537 5.4%
 
0.8987006667 8 0.1%
 
0.940385 8 0.1%
 
0.974074 6 0.1%
 
0.9194576 5 0.1%
 
0.9607115 5 0.1%
 
0 5 0.1%
 
0.8597823333 4 < 0.1%
 
0.9819233333 4 < 0.1%
 
0.933133 4 < 0.1%
 
Other values (9252) 9414 94.1%
 
ValueCountFrequency (%) 
0 5 0.1%
 
0.0009995121951 1 < 0.1%
 
0.001943243243 1 < 0.1%
 
0.002180869565 1 < 0.1%
 
0.002213166667 1 < 0.1%
 
ValueCountFrequency (%) 
1 537 5.4%
 
0.9999617366 1 < 0.1%
 
0.9968665 1 < 0.1%
 
0.995 2 < 0.1%
 
0.9944965 1 < 0.1%
 

moramaxima_12_meses_1
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count91
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.0129
Minimum0
Maximum364
Zeros4222
Zeros (%)42.2%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median17
Q330
95-th percentile107
Maximum364
Range364
Interquartile range (IQR)30

Descriptive statistics

Standard deviation40.10702172
Coefficient of variation (CV)1.541812782
Kurtosis12.11807548
Mean26.0129
Median Absolute Deviation (MAD)26.5646022
Skewness2.96156024
Sum260129
Variance1608.573191
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 7. 14.5 15.5 ... 197.5 200.5 228.5 229.5 364. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4222 42.2%
 
17 1366 13.7%
 
30 549 5.5%
 
47 375 3.8%
 
16 324 3.2%
 
18 300 3.0%
 
48 286 2.9%
 
28 230 2.3%
 
29 190 1.9%
 
13 164 1.6%
 
Other values (81) 1994 19.9%
 
ValueCountFrequency (%) 
0 4222 42.2%
 
1 36 0.4%
 
13 164 1.6%
 
14 42 0.4%
 
15 136 1.4%
 
ValueCountFrequency (%) 
364 2 < 0.1%
 
351 4 < 0.1%
 
321 4 < 0.1%
 
320 2 < 0.1%
 
304 2 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
0
6540
1
3460
ValueCountFrequency (%) 
0 6540 65.4%
 
1 3460 34.6%
 

antiguedad_en_el_sistema_financiero
Real number (ℝ≥0)

ZEROS
Distinct count138
Unique (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.3318
Minimum0
Maximum339
Zeros3368
Zeros (%)33.7%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q314
95-th percentile56
Maximum339
Range339
Interquartile range (IQR)14

Descriptive statistics

Standard deviation21.91461462
Coefficient of variation (CV)1.777081579
Kurtosis24.20428634
Mean12.3318
Median Absolute Deviation (MAD)13.55891092
Skewness3.833282619
Sum123318
Variance480.2503338
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 4.5 6.5 ... 96.5 117. 144. 215. 339. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 3368 33.7%
 
2 509 5.1%
 
3 476 4.8%
 
4 436 4.4%
 
5 378 3.8%
 
1 338 3.4%
 
6 322 3.2%
 
7 286 2.9%
 
8 254 2.5%
 
10 240 2.4%
 
Other values (128) 3393 33.9%
 
ValueCountFrequency (%) 
0 3368 33.7%
 
1 338 3.4%
 
2 509 5.1%
 
3 476 4.8%
 
4 436 4.4%
 
ValueCountFrequency (%) 
339 2 < 0.1%
 
217 2 < 0.1%
 
213 2 < 0.1%
 
202 2 < 0.1%
 
200 2 < 0.1%
 

numero_de_creditos_vigentes
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1235
Minimum0
Maximum5
Zeros8955
Zeros (%)89.5%
Memory size78.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3973204454
Coefficient of variation (CV)3.217169599
Kurtosis23.76677782
Mean0.1235
Median Absolute Deviation (MAD)0.2211885
Skewness4.126298584
Sum1235
Variance0.1578635364
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 3.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 8955 89.5%
 
1 897 9.0%
 
2 118 1.2%
 
3 22 0.2%
 
5 4 < 0.1%
 
4 4 < 0.1%
 
ValueCountFrequency (%) 
0 8955 89.5%
 
1 897 9.0%
 
2 118 1.2%
 
3 22 0.2%
 
4 4 < 0.1%
 
ValueCountFrequency (%) 
5 4 < 0.1%
 
4 4 < 0.1%
 
3 22 0.2%
 
2 118 1.2%
 
1 897 9.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

clientemora30mora60segmentosectorregconsfdesemingresospersonascargogastostiempactivanoocupaciontipcontratoedadestado_civilgeneroingresos_totalesnivel_academicotipo_viviendacalificacion_superfinancieracalificacionsistema_financieromoramaxima_12_meses%deuda_actual_sistema_financieromoramaxima_12_meses_1experiencia_financieraantiguedad_en_el_sistema_financieronumero_de_creditos_vigentes
0100PYSERVICIOSESPECIAL2013-11-200.00000000.3000EMPLEADOTÉRMINO INDEFINIDO24SOLTEROFEMENINO0.600UNIVERSITARIOFAMILIARANaN00.6367420040
1211MPYAGROPECUARIOOCCIDENTE2015-12-161.50000030.3004EMPLEADOTÉRMINO FIJO30OTROSMASCULINO0.600BACHILLERFAMILIARA666610.934640611100
2300PYPERSONAS NATURALESOCCIDENTE2013-05-082.05826900.3081EMPLEADOTÉRMINO INDEFINIDO37SOLTEROMASCULINO0.616TECNÓLOGOFAMILIARANaN00.1172590090
3400MPYAGROPECUARIOCENTRO2015-11-113.50000010.3086EMPLEADOTÉRMINO FIJO38SOLTEROFEMENINO0.616BACHILLERFAMILIARA658291.00000029180
4500MPYAGROPECUARIOSUR2013-12-133.65500000.3083EMPLEADOTÉRMINO INDEFINIDO23SOLTEROFEMENINO0.616UNIVERSITARIOFAMILIARA765170.354826170120
5600MPYAGROPECUARIOOCCIDENTE2015-10-153.83149300.3085EMPLEADOLIBRE NOMBRAMIENTO O REMOCIÓN52OTROSFEMENINO0.616BACHILLERFAMILIARA697270.866539271171
6700MDOAGROPECUARIOSUR2015-12-104.00000000.3084EMPLEADOTÉRMINO INDEFINIDO29SOLTEROFEMENINO0.616TECNÓLOGOALQUILADAA73900.9620870000
7800MDOPERSONAS NATURALESESPECIAL2012-06-285.20000040.3083EMPLEADOTÉRMINO INDEFINIDO35CASADOMASCULINO0.616BACHILLERALQUILADAA84800.3442230130
8900PYAGROPECUARIOCENTRO2013-10-087.71600010.3084EMPLEADOTÉRMINO FIJO36DIVORCIADOFEMENINO0.616BACHILLERFAMILIARA.270.33195427080
91011PYAGROPECUARIOCENTRO2013-12-127.71600000.3081EMPLEADOTÉRMINO INDEFINIDO20SOLTEROMASCULINO0.616BACHILLERFAMILIARA.900.33331190000

Last rows

clientemora30mora60segmentosectorregconsfdesemingresospersonascargogastostiempactivanoocupaciontipcontratoedadestado_civilgeneroingresos_totalesnivel_academicotipo_viviendacalificacion_superfinancieracalificacionsistema_financieromoramaxima_12_meses%deuda_actual_sistema_financieromoramaxima_12_meses_1experiencia_financieraantiguedad_en_el_sistema_financieronumero_de_creditos_vigentes
9990999100MDOPERSONAS NATURALESCENTRO2015-07-031.39500020.3081EMPLEADOTÉRMINO INDEFINIDO29OTROSMASCULINO1.170051BACHILLERFAMILIARA71900.8542180000
9991999200MDOPERSONAS NATURALESNORTE2015-10-051.39500000.3085EMPLEADOTÉRMINO INDEFINIDO42SOLTEROMASCULINO1.190805UNIVERSITARIOFAMILIARA69700.946871002170
9992999310MDOPERSONAS NATURALESESPECIAL2013-05-061.39500020.30815EMPLEADOTÉRMINO INDEFINIDO35CASADOFEMENINO1.202463TECNÓLOGOFAMILIARA659470.11040147020
9993999400MDOPERSONAS NATURALESESPECIAL2013-04-171.39500010.3084EMPLEADOTÉRMINO INDEFINIDO25OTROSMASCULINO1.205505UNIVERSITARIOFAMILIARA84800.07208101160
9994999500MDOPERSONAS NATURALESESPECIAL2015-07-211.39500000.3082EMPLEADOTÉRMINO INDEFINIDO26OTROSFEMENINO1.207553UNIVERSITARIOFAMILIARA719300.85863530021
9995999600MDOPERSONAS NATURALESOCCIDENTE2015-08-101.39500010.3081EMPLEADOTÉRMINO INDEFINIDO39CASADOFEMENINO1.222883TECNÓLOGOFAMILIARA60500.8792270111
9996999711MDOPERSONAS NATURALESOCCIDENTE2015-04-271.39500020.3081EMPLEADOTÉRMINO INDEFINIDO29OTROSMASCULINO1.170051BACHILLERFAMILIARA831470.841957470160
9997999800MDOPERSONAS NATURALESCENTRO2015-06-091.39526400.3085EMPLEADOTÉRMINO INDEFINIDO42SOLTEROMASCULINO1.190805UNIVERSITARIOFAMILIARA72800.0513930121
9998999900MDOPERSONAS NATURALESCENTRO2015-09-071.39526420.30815EMPLEADOTÉRMINO INDEFINIDO35CASADOFEMENINO1.202463TECNÓLOGOFAMILIARA584300.90212830011
99991000000MDOPERSONAS NATURALESCENTRO2015-08-261.39537710.3084EMPLEADOTÉRMINO INDEFINIDO25OTROSMASCULINO1.205505UNIVERSITARIOFAMILIARA69700.9237310111